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Abstract 

This paper considers a class of deploy and search strategies for multi-robot 
systems and evaluates their performance. The application framework used is 
a system of autonomous mobile robots equipped with required sensors and 
communication equipment deployed in a search space to gather information. 
The lack of information about the search space is modeled as an uncertainty 
density distribution over the search space. A combined deploy and search 
(CDS) strategy has been formulated as a modification to sequential deploy 
and search (SDS) strategy presented in our previous work. The optimal 
deployment strategy using Voronoi partition forms the basis for these two 
search strategies. The strategies are analyzed in presence of constraints on 
robot speed and limit on sensor range for convergence of trajectories with cor- 
responding control laws responsible for the motion of robots. SDS and CDS 
strategies are compared with standard greedy and random search strategies 
on the basis of time taken to achieve reduction in the uncertainty density 
below a desired level. The simulation experiments reveal several important 
issues related to the dependence of the relative performances of the strategies 
on parameters such as number of robots, speed of robots, and their sensor 
range limits. 
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1. Introduction 

In nature, we observe groups of animals performing a large number of 
complex tasks by cooperating among themselves in a distributed manner. 
Each member of the group performs a relatively simple task using informa- 
tion available in its neighborhood leading to emergence of a collective behav- 
ior. Flocks of birds, schools of fish, foraging of ants are a few examples of 
this phenomena. Nature seems to solve complicated problems in simple and 
elegant ways. Instead of increasing the complexity/intelligence of an individ- 
ual to solve a complex problem, nature relies on a group of individuals with 
nominal abilities. The outcome of such cooperation is more robust to failure 
of a few of the group members. From a more philosophical perspective, the 
focus is on a society rather than on an individual member. 

The problem of searching for targets in unknown environments has been 



addressed in the literature in the past [18| , [31i , [5j , and |22|. Various search 
strategies available in the literature have been surveyed in [sj. These fun- 
damental works were mostly theoretical in nature and were applicable to a 
single agent searching for single or multiple, static or moving, targets. It 
is likely that the same task can be accomplished more effectively by multi- 
ple searchers. But when multiple agents are involved, coordination between 
them becomes an important issue. Although a centralized controller can 
solve the problem, it has many shortcomings such as requirement of com- 
plete connectivity, large communication effort, etc. Further, failure of the 
central controller leads to failure of the entire system. As discussed earlier, 
most biological systems such as ants, birds, fish etc., have distributed local 
decision making capabilities which, in turn, lead to a useful collective be- 
havior such as swarms, fiocks, schools, etc. Agents taking decisions based 
only on available local information and distributed control law (usually re- 
ferred to as 'behavior' in biological systems), lead to coordination among 
the agents and result in a meaningful collective behavior. Developments 
in areas such as wireless communication, autonomous vehicular technology, 
computation, and sensors, facilitate the use of large number of agents (UAVs, 
mobile robots, or autonomous vehicles), equipped with sensors, communica- 
tion equipment, and computation ability, to cooperatively achieve various 
tasks in a distributed manner. 
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Distributed multi-agent systems have been shown to achieve and main- 
tain formations, move as flocks while avoiding obstacles, etc., thus mimicking 
their biological counterparts. They can also be used in applications such as 
search and rescue, surveillance, multiple source identification, and coopera- 
tive transportation. The major advantages of distributed systems are immu- 
nity to failure of individual agents, their versatility in accomplishing multiple 
tasks, simplicity of agents' hardware, and requirement of only minimal local 
information. At the same time it is important to design distributed con- 
trol laws that guarantee stability and convergence to the desired collective 
behavior under limited information and evolving network configurations. 

1.1. Related Literature 

One class of problems discussed in the literature is that, of optimally 
locating agents or sensors in the domain of interest and belongs to the class 
of locational optimization or facility location problems A centroidal 



Voronoi configuration is a standard solution for this class of problems |11 
where each of the agents is located at the centroid of the corresponding 
Voronoi cell. Cortes et al. [g], 0] use these concepts to solve a spatially 
distributed optimal deployment problem for multi-agent systems. A density 
distribution, as a measure of the probability of occurrence of an event, is 
used, along with a function of the Euclidean distance providing a quantitative 
assessment of how poor the sensing performance is, to formulate the problem. 
Centroidal Voronoi configuration, with centroids of Voronoi cells computed 
based on the density distribution within the cell, is shown to be the optimal 
deployment of sensors minimizing the sensory error. The Voronoi partition 
becomes natural optimal partitioning due to monotonic variation of sensor 



effectiveness function with the Euclidean distance. Schwager et al. [29 
interpret the density distribution of [3] in a non-probabilistic framework and 
approximate it by sensor measurements. Sujit and Beard [s^l present an 
exploration system for multiple unmanned aerial vehicles (UAVs) navigating 
through a simulated unknown region that contains obstacles of unknown 
shape, size, and initial position. They perform Monte-Carlo simulation to 
analyze the effect on area coverage with changes in number of agents, sensor 
range, and communication range. 

Cooperative search bx multiple agents has been studied by various re- 
searchers. Enns et al. [1^] use predefined lanes prioritizing them with the 
target probability. The vehicles cooperate to ensure that the total path length 
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covered by them is minimized while exhaustively searching the area. A dy- 
namic inversion based control law is used to make vehicles track the assigned 
tracks or lanes while considering the maximum turn radius constraint. Spires 
and Goldsmith |30] use space filling curves such as Hilbert curves to cover a 
given space and perform exhaustive search by multiple robots. Vincent and 
Rubin [33| address the problem of cooperative search strategies for UAVs 



searching for moving, possibly evading, targets in a hazardous environment. 
They use predefined swarm patterns with an objective of maximizing the 
target detection probability in minimum expected time and using minimum 
number of UAVs having limited communication range. Beard and McLain 
use dynamic programming methods to develop strategies for a team of coop- 
erating UAVs to visit as many opportunities without collision while avoiding 
hazards, in a search area which contains regions of opportunity and hazards. 
The UAVs have the additional requirement that they should stay within 
communication range of each other. Flint et al. 13| provide a model and 



algorithm for path planning of multiple UAVs searching in an uncertain and 
risky environment using dynamic programming approach. The search area 
is divided into cells and in each cell the probability of existence of a target 
defined. Pfister 25j uses fuzzy cognitive map to model the cooperative con- 
trol process in an autonomous vehicle. In [26j, 35], and 36] the authors use 
distributed reinforcement learning and planning for cooperative multi-agent 
search. The agents learn the environment online and store the information 
in the form of a search map and utilize this information to compute their tra- 
jectory. The agents are assumed to have limited maneuvering ability, sensor 
range and fuel. In 35| the authors show a finite lower bound on the search 



time. Rajnarayan and Ghose 27| use concepts from team theory to formu- 
late the multi-agent search problem as a nonlinear programming problem in 
a centralized perfect information case. The problem is then reformulated in 
a Linear-Quadratic-Gaussian setting that admits a decentralized team theo- 
retic solution. Dell et al. [sl develop an optimal branch-and-bound procedure 
with heuristics such as combinatorial optimization, genetic algorithm and lo- 
cal start with random restarts, for solving constrained-path problems with 
multiple searchers. Jin et al. 17|] address a search and destroy problem in a 
military setting with heterogeneous team of UAVs. Altshuler et al. |2| ex- 
amine the Cooperative Hunters problem, where a swarm of UAVs is used for 
searching one or more evading targets, which are moving in a predefined area 
while trying to avoid a detection by the swarm. Sathyaraj et al. [28^ provide 
a comparative study of path planning algorithm for multiple UAVs used in 
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team for detecting targets and keeping them in its sensor range. Yatsenko et 



al. [37[ discuss problems dealing with cooperative control of multiple agents 
movin g in a region searching for targets. 

In [l6| we have proposed a search strategy namely sequential deploy and 
search (SDS) for multiple agents such as UAVs or mobile robots using Voronoi 
partition with some preliminary results. In this work, an uncertainty density 
was used to model the lack of information about the search area. Search 
agents are deployed optimally maximizing a single step search effectiveness 
and then perform search. Each agent performs search within its Voronoi 
cell where it is most effective. The optimal deployment was shown to be 
a variation of centroidal Voronoi configuration, where each agent is located 
at the centroid of its Voronoi cell with perceived uncertainty density. The 



material presented in this paper is based on the doctoral thesis [15 . 

In this paper, we provide a more detailed account of the SDS strategy 
and propose a modification of the this strategy, named as combined deploy 
and search (CDS) where, the mobile robots perform search in discrete steps, 
as they move toward the centroids. We compare the performance of SDS and 
CDS strategies with standard greedy and random search strategies based on 
simulation experiments. 

1.2. Contribution of the paper 

We address a multi-robot search problem where agents, equipped with 
sensors, search the space Q, a convex polytope in M.'^, the d-dimensional 
Euclidean space. The lack of information about the search space is mod- 
eled as an uncertainty density distribution : Q i— > [0,1]. We denote 
Pit) = {pi,P2, ■ ■ ■ ,Pn}, Pi £ Q, with Pi 7^ pj, whenever i ^ j, as the robot 
configuration at time t. We formulate a combined deploy and search (CDS) 
strategy for multiple mobile robots as a modification to_sequential deploy and 
search (SDS) strategy proposed in our previous work [16|]. The fundamental 
concept is "deploy" and "search". In each search step, the uncertainty is 
reduced as 

0n+i(g) = 0n(g) min{/5(||p, - g||)} 

where, n is the search step count, and /5 : M i— [0, 1] is the function acting 
as a reduction factor for the uncertainty density 0. The function f3 is such 
that, 1 — (3 is the sensor effectiveness function. 
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We consider following multi-center objective function, which when maxi- 
mized, maximizes a single step effectiveness 



where, n is the search step count, Vi is the Voronoi cell corresponding to the 
i-th agent/robot. The critical points of the objective function for a given n is 
shown to be centroids of Voronoi cells with perceived density. Unlike SDS, the 
agents/robots do not wait till getting optimally deployed to perform search. 
The critical points arc used only to get a direction of motion for robots. 
We show, that the CDS strategy can reduce the average uncertainty to any 
arbitrarily small value in finite time. The optimal deployment strategy has 
been analyzed in presence of some constraints on robot speed and limit on 
sensor range for convergence of the robot trajectories with the corresponding 
control laws responsible for the motion of robots. 

Simulation experiments are carried out to validate and compare the per- 
formance of SDS and CDS with two generic strategies, namely, greedy and 
random search. The simulation results indicate that both the proposed search 
strategies perform quite well even when the conditions deviated from the as- 
sumed ones such as sensor range limitations and the CDS strategy leads to 
somewhat shorter and smoother trajectories than those of the SDS strategy 
with the same parameters. 

1.3. Organization of the papers 

The paper is organized as follows. We preview a few mathematical con- 
cepts used in this work in Section 2. The multi-robot search problem is 
discussed in Section 3. Section 4 discusses the combined deploy and search 
(CDS) strategy. The objective function, instantaneous critical points, and 
control law responsible for robot motion are presented in this section. In 
Section 5 we impose a few constraints on robot speed and also limit on sen- 
sor range, propose control laws and provide convergence results. The CDS 
strategy is explained with help of illustrative examples in comparison with 
the SDS strategy in Section 6. A few implementation issues are discussed in 
Section 7. In section 8, we provide and discuss simulation results and the 
paper concludes in section 9. 



■n 



ie{l,2,...,N} 
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2. Mathematical preliminaries 

In this section we preview mathematical concepts from Voronoi partition, 
LaSalle's invariance principle, and Liebniz theorem used in the present work. 

2.1. Voronoi partition 

Voronoi partition [sl, [sl is a widely used scheme of partitioning a given 
space and finds applications in many fields such as CAD, image processing 
and sensor coverage. We use the Voronoi decomposition scheme to partition 
the search space. Here we briefly preview the concept. 

By a partition of a set X we mean a collection of subsets Wi of X with 
disjoint interiors such that their union is X itself. Let Q C M.'^, be a convex 
polytope in M'^, the (i- dimensional Euclidean space. We define the configura- 
tion space as P = {V = {pi,P2, ■ ■ ■ ,Pn}}, Pi ^ Q, position of i-th node in Q. 
The Voronoi partition generated by P G P with respect to Euclidean norm 
is the collection {V^('P)}iG{i,2,...,n} defined as, 

V{V) = {qeQ\ \\q~p^ \\<\\q-Pj ||,Vp, GP} 

The Voronoi cell Vi is the collection of those points which are closest to 
Pi compared to any other point in V. The boundary of each Voronoi cell is 
the union of a finite number of line segments forming a closed C° curve. In 
M^, The intersection of any two Voronoi cells is either null, a line segment, 
or a point. In a general ci- dimensional space, the boundaries of the Voronoi 
cells are unions of convex subsets of at most d — 1 dimensional hyperplanes 
in Mf^ and the intersection of two Voronoi cells is either a convex subset 
of a hyperplane or a null set. Each of the Voronoi cells is a topologically 
connected non-null set. 



2.2. LaSalle's Invariance principle 

Here we state LaSalle's invariance principle 21, 0] used widely to study 
the stability of nonlinear dynamical systems. We state the theorem as in 23 
(Theorem 3.8 in 23|). 

Consider a dynamical system in a domain D 

x = f{x),f:D^W' (1) 

Let V : D ^ M be a continuously different iable function and assume that 
(i) M C -D is a compact set, invariant with respect to the solutions of ([T]). 
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(ii) V" < in M. 

(iii) E : {x : X & M, and V{x) = 0}; that is E is set of all points of M such 
that V{x) = 0. 

(iv) is the largest invariant set in E. 

Then every solution of ([T]) starting in M approaches N as t ^ oo. 

Here by invariant set we mean that if the trajectory is within the set at 
some time, then it remains within the set for all time. Important differences 
of the LaSalle's invariance principle as compared to the Lyapunov Theorem 
are (i) V is required to be negative semi-definite rather than negative definite 
and (ii) the function V need not be positive definite (see Remark on Theorem 
3.8 in Q, PP 90-91). 

2.3. Leibniz Theorem and its generalization 



The Leibniz Theorem is widely used in fluid mechanics [20[, and shows 
how to differentiate an integral whose integrand as well as the limits of inte- 
gration are functions of the variable with respect to which differentiation is 
done. The theorem gives the formula 



— / F{x,y)dx= / —dx + —F{b,y)-—F{a,y) 2 
dyJa(y) J a dy dy dy 



Eqn. ([2]) can be generalized for a (i-dimensional Euclidean space as 
d r „ r dF 



, f F{x,y)dV= f ^dV+ f n{x).u{x)FdS (3) 
dy Jv{y) Jv oy Js 

where, V C R'^ is the volume in which the integration is carried out, dV is 
the differential volume element, S is the bounding hypersurface of V, n(x) 
is the unit outward normal to S and u(x) = ^{x) is the rate at which the 
surface moves with respect to ?/ at x G S. 



3. Multi-robot search 

In this section we discuss the problem addressed in this paper. robots 
perform search operation in an unknown environment. The lack of informa- 
tion is modeled as an uncertainty density distribution over the search space 
Q. The problem addressed in this paper is that of deploying robots in Q 
to collect information, thereby reducing the uncertainty density distribution 
over Q. The problem formulation is stated formally as 
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1. Q C M"^ is a convex polytope and is the search space. 

2. (f) : Q [0, 1] defines the uncertainty density function representing lack 
of information. 

3. robots, equipped with sensors and communication equipment, deploy 
themselves in Q, and gather information, thus reducing the uncertainty. 

4. P{t) = {pi{t),p2{t),...,pN{t)} C Q, with Pi ^ pj whenever i ^ j, 
denotes the configuration of the multi-robot system at time t, Pi{t) de- 
notes the position of the i-th robot at time t. In future, for convenience, 
we drop the variable t and refer to the positions by just pi. 

5. Sensor's effectiveness at a point reduces with distance from the sensor. 

6. Ideally, we are looking for an optimal way of utilizing the robots to 
acquire complete information about Q, and thus have 0(g) = 0, Vg G Q. 

During the search operation, sensors gather information about Q, reduc- 
ing the uncertainty density as, 

0n+i(g) = <i)n{q) min{/?(||pi - g||)} (4) 

where, 0n(g) is the density function at the n-th iteration; /3 : M i— [0, 1] is 
a function of Euclidian distance of a given point in space from the robot, 
and acts as the factor of reduction in uncertainty by the sensors; and Pi is 
the position of the i-th sensor. At a given q & Q, only the robot with the 
smallest l3{\\pi — q\\), that is, the robot which can reduce the uncertainty by 
the largest amount, is active. If any robot searches within its Voronoi cell, 
then the updating function (jl]) gets implemented automatically. That is, the 
function minj{/3(||pj — g||)} is simply M\\Pi — iW), where Pi G Vi. 



In the SDS strategy proposed in [16|, the agents get optimally deployed 



before performing search. In order to maximize the search effectiveness in 
each search step, following objective function was considered to be maxi- 
mized. 

Hn = jQA(f)^{q)dQ 

= /Qmaxig{i,2,...,iv}{(|0n(g) - g||)0n(g)|)}c^Q 
= /Q(0n(g) -minig{i,2,...,iv}{/?(lbi -g||)}0n(g))c?<5 
= E,,e{i,2,...,iV} h - - Q\\))dQ 

where, n is "deploy" and "search" count, Vi is the Voronoi cell corresponding 
to the i-th agent, and Pi E Q is the position of the i-th agent. 
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The search effectiveness function /3 : R i— > [0, 1] is a non-decreasing func- 
tion capturing effectiveness of the sensor. Consider 

/3(r) = 1 - ke-"'', k G (0, 1) and a > 

2 

Here, fce""*" represents the effectiveness of the sensor which is maximum 
at r = and tends to zero as r ^ oo and (3 is minimum at r = (effecting 
maximum reduction in 0) and tends to unity as r — oo (Figure [1]) (change 
in reduces to zero as r increases). Most sensors' effectiveness reduces over 
distance as the signal to noise ratio increases with the distance. Thus j3, 
which is upside down Gaussian, can model a wide variety of sensors with two 
tunable parameters k and a. 




2 4 6 8 10 



Figure 1: The sensor effectiveness and updating function 

The optimal deployment configuration was shown to be a variation of 
centroidal Voronoi configuration, where each robot is located at the centroid 
of its Voronoi cell computed with a density 0„(g) = (f)n{<l)ke~°'^i , which is 
the density as perceived by the sensor, with /3(r) = 1 - ke-'^'' , k e (0,1), 
and a > 0, as search effectiveness function. 

We use Voronoi partition in formulating the search strategies which along 
with advantages can cause some computation overhead (see [1] and references 
therein for time complexity of computing Voronoi partitions). This issue has 
been addressed in the literature (see ^ and references therein) and there are 
a few algorithms that efficiently implement Voronoi partition related compu- 
tations. Also, Voronoi based strategies result in collision free trajectories in 
a natural way, which is an added advantage. 
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3.1. Optimal deployment strategy 

We have shown that the uncertainty reduction will be maximized in a 
single step of search, if the agents are located at the centroids of respective 
Voronoi cells. In SDS, the agents get deployed optimally in this sense before 
performing search. In this section we will provide the control law to make 
the agent move toward the centroids and achieve the optimal deployment 



configuration. Though we have discussed the control law in [16|, we provide 
it here for the sake of completeness and clarity. 

Typically search problems do not consider dynamics of search agents as 
the focus is more on the effectiveness of search, that is, being able to identify 
region of high uncertainty and distribute search effort to reduce uncertainty. 
Moreover, it is usually assumed that the search region is large compared to 
the physical size of the agent or the area needed for the agent to maneuver. In 
this paper, we assume that the agents are modeled as simple first dynamical 
systems as 

Pi = Ui (6) 

Consider the control law 



Ui 



-kpropiPi - CyJ (7) 



Control law ([7]) makes the agents move toward Cy- for positive control 
gain, kprop- 



We have shown in [16|, using LaSalle's invariance principle, that the tra- 
jectories of the agents governed by the control law ([7j), starting from any 
initial condition P(0) G Q'^ , will asymptotically converge to the critical 
points of Hn- 



4. Combined deploy and search (CDS) strategy 

In the SDS strategy. The robots first get optimally deployed and then 
search is performed. The "deploy" and "search" steps continue till the un- 
certainty density is reduced below a desired value. Here the optimal deploy- 
ment strategy ensures that the uncertainty density reduction is maximized 
in each search step. But it does not guarantee optimal trajectories of the 
robot. During the deployment stage, the robots move without utilizing the 
sensors. Intuitively, it seems that the trajectories will be closer to optimal if, 
as the robots are moving toward the respective Cy^, they also simultaneously 
perform the search operation in discrete steps. We define the latency, tg, of 
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the robots as the maximum time taken to acquire the information, process it, 
and successfully update the uncertainty density. The time interval between 
each search should be more than tg- Here we shall formulate such a strategy 
and name it combined deploy and search (CDS) strategy. 

4.1. Density update 

Here we provide the problem formulation for the combined deploy and 
search strategy. Assume that the index n represents the intermediate step 
at which the search is performed and uncertainty density is updated. Using 
the uncertainty density update rule (jlj) discussed earlier we can get. 



A„0(g) = 0„+i(g) - 0„(g) = 0„(g) min(l 



Pi - Q 



(8) 



^ r 



Integrating ([8]) over Q, 



Q 



Aq)dQ 



(9) 



A$„= / - m P^ - Q \\))dQ 



(10) 



4-2. Objective function 

The objective function ([5]), used for SDS strategy [l6|, is fixed for each 
deployment step as (f)n{(l) is fixed for the n-th iteration. In combined deploy 
and search, the search task is performed as the robots move. Now an objective 
function to be maximized in order to maximize the uncertainty reduction at 
the n-th search step is 



i€{l,2,...,N} 



0„(g)(l-/?(|b,-g||))cig 



Note that the above objective function is same as ([5]) except for the fact that 
n in this case represents the search step count, whereas in ([5]) it represents 
'deploy and search' step count. For P{r) = 1 — A;e~"^ , the objective function 
( TTT]) becomes, 

nn= Yl I 0n(g)fce-"^'rfQ (12) 
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It can be noted that for a given n, the uncertainty density 0n(g) at any 
g G Q is constant. The gradient is given as (see Theorem A.l in Appendix), 



dpi 



J2 I 0n(g)A;e-"("P'-«ll)'(-2a)fe-g)rfQ 
ie{i,2,...,Af} "^^^ 

-2aMvXp^-Cv^ (13) 



where and Cy^ are the mass and the centroid of Vi with respect to 4>n{,'i) = 
0„(g)fce~"'"» , which is the density as perceived by the sensor. The critical 
points are same as those obtained for SDS. But the uncertainty changes 
in every time step and hence the critical points also change. Hence, the 
corresponding critical points are only the instantaneous critical points. It 
should be noted that the above treatment is valid for any non-decreasing 
continuously differentiable /?(■) with 0(-) depending on exact nature of the 
function /?(■). We use the control law 

Ui = -kpropiPi - Cv,) (14) 

Control law ([7j) makes the robots move toward Cy- for positive control gain, 
k 

i^prcyp ■ 

The instantaneous critical points and the gradient ( fT3|) are used in control 
law only to make the robots move toward the instantaneous centroids 
rather than deploying them optimally. Thus, it is not possible to prove any 
optimality of deployment and we do not prove the convergence of the trajec- 
tories here in case of CDS. In CDS, compared to SDS, robots perform more 
frequent searches instead of waiting till the optimal deployment maximizing 
per step uncertainty reduction. 

To implement the control law, centroid of each Voronoi cell needs to be 
computed. The computational overhead of computing the centroid can be 
reduced at the cost of slower convergence using methods reported in the 



literature such as random sampling and stochastic approximation [19|, [14 
In addition, we discretize the search space into grids while implementing the 
strategy. This simplifies the computation of the centroid of Voronoi cells. 
The main focus of this paper is design and demonstration of the multi-robot 
search strategy and finer issues such as computation complexities are beyond 
the scope of this paper. 

It can be shown that he combined deploy and search strategy is spatially 
distributed over the Delaunay graph Qd. Here by spatially distributed we 
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mean that information from neighboring robots is sufficient for computation 
of control input. A Delaunay graph Qd is an undirected graph, where two 
agents/robots are said to be neighbors (connected by an edge) if the corre- 
sponding Voronoi cells are adjacent. 

Theorem 1. The combined deploy and search strategy can reduce the aver- 
age uncertainty to any arbitrarily small value in finite time. 

Proof. Consider the uncertainty density update law (jl]) for any q E Q, 

= (1 - ke-"'^')^n-i{q) = 7n-i0n-i(g) (15) 



where, is the distance of point q & Q from the i-th robot, such that q & Vi, 
the Voronoi cell corresponding to it and, 7n-i = (1 — fce~"^* ). 
Applying the above update rule recursively, we have, 

0n(<?) = 7n-l7n-2 • • • llloMl) (16) 

Let D{Q) := maXp^^ggdl p — q ||). We note that 

(i) < A; < 1 

(ii) < Tj < D{Q). D(Q) is bounded as the set Q is bounded, 
(hi) < 7j < 1 - A;e-"{^(<3)n = / (gay), j g N; and / < 1 

Now consider the sequence {T} , 

Tn = 7n7n-l • • • 7l70 < ^"^^ 

Taking limits as n ^ oo, 



lim r„ < lim / 

n— >oo n—too 



n+1 



Thus, 



lim (^„(g) = lim r„_i(/)o((?) = 



n— >oo n— >oo 



As the uncertainty density vanishes at each point g G Q in the limit, the 
average uncertainty density over Q is bound to vanish as n — oo. Thus, the 
average uncertainty density can be reduced to arbitrarily small value in finite 
time. □ 
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It can be observed that the above proof does not depend on the control 
law. The theorem depends only on the outcome of the choice of the updating 
function (jlj), along with the fact that there is no sensor range limitation, and 
that the search space Q is bounded. In addition, the theorem does not 
address the issue of optimality of the strategy which, in fact, depends on the 
control law which is responsible for the motion of the robots. Further, unlike 
SDS, maximal uncertainty reduction is also not guaranteed in each search 
step. In case of SDS, the reduction in the uncertainty in each step in SDS is 



which is the maximum possible reduction in a single step. The deployment 
is such that uncertainty will be reduced to a maximum possible extent in a 
step, given by the above formula, whereas in CDS the uncertainty reduction 
in n-th search step is given by 



where it is not required that pi = Cy- while performing search. Though the 
uncertainty reduction in a given search step n in CDS is less than that in 
SDS, ss will be seen in later sections, the CDS performs better compared to 
SDS in terms of faster uncertainty reduction due to more frequent searches. 

5. Constraints on robot speed and sensor range 

In previous sections we have formulated the multi-robot search strategies 
under ideal conditions and provided a few useful analytical results for conver- 
gence and spatial distributedness providing an analytically sound platform 
for analysis. But in a practical situation these conditions may be violated. It 
is more likely that the robots will have limit on maximum speed or they may 
be constrained to move with a constant speed. Further, the sensors could 
have limit on their range. In this section we analyze the proposed strategies 
in the presence of speed and sensor range limitations. 

5.1. Maximum speed constraint 

Let the robots have a constraint on maximum speed of Umaxi, for i = 
1, . . . , n. Now consider the control law 




(17) 




(18) 
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''^=\_U (P^-Cv,) otherwise ^^^^ 

The control law (IT^ makes the robots move toward their respective cen- 
troids with saturation on speed. 

Theorem 2. The trajectories of the robots governed by the control law / flPj) . 
starting from any initial condition V{0) G , will asymptotically converge 
to the critical points of Tin- 

Proof. Consider V{V) = -Hn- 



dt Z-^ie{l,2,...,N} dp, I'l 

2« EiG{i,2,...,m ^vAPi - CvJ{-kprop){Pt - CyJ, If Ui < U, 

' ' ' ~ ~ {Pi—C ) 

2«EiG{l,2,...,7V} Mv,{Pr - CyJ(-f/„^ax^)^^Jzc^?|^' 
—2akprop Eie{l,2,...,Ar} 

MvX\\p^-Cv^\)^ \iu,< Umaxi 

— ( Hp ■ —C II )^ 
-2a Eie{i,2,...,jv} f^^^^^^v-.^^^^;^, otherwise 



max I 



(20) 

We observe that 

1. : Q I— i> M is continuously differentiable in Q as {Vi} depends at 
least continuously on P, and V is continuous as u is continuous if not 
smooth. 

2. M = Q is a compact invariant set. 

3. y is negative definite in M. 

4. E = V-\0) = {Cy^}. 

5. E itself is the largest invariant subset of E by the control law fll9l) . 

Thus, by LaSalle's invariance principle, the trajectories of the robots gov- 
erned by control law f|T9|) . starting from any initial configuration V{0) G Q^, 
will asymptotically converge to the set E, the critical points of 7i„, that is, 
the centroidal Voronoi configuration with respect to the density function as 
perceived by the sensors. □ 
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5.2. Constant speed control 

The robots may have a constraint of moving with a constant speed Ui. 
But we let the robots slow down as they approach the critical points. For 
i = 1, . . . ,n, consider the control law 

-UiT^—^, if lb* -Cv\\>6 
—Ui{pi — otherwise 

where, 5 > 0, predefined value, such that the control law fl?Il) makes the 
robots move toward their respective centroids with a constant speed of Ui 
when they at a distance greater than 5 from the corresponding centroids and 
slow down as they approach them. 

Theorem 3. The trajectories of the robots governed by the control law ^21\} . 
starting from any initial condition P(0) G , will asymptotically converge 
to the critical points of Tin. 

Proof Consider V{V) = -Tin- 



V{V) 



We observe that 



-2" E»e{i,2,...,JV} U^Mv,^-^^^-^^, 

if \\pi - CyJI > 5 

-2ttEie{i,2,...,iV} UiMvM - C'fJA 
otherwise 



(22) 



1. : Q 1-^ M is continuously differentiable in Q as {V^} depends at 
least continuously on V, and V is continuous as u is continuous if not 
smooth. 

2. M = Q is a compact invariant set. 

3. y is negative definite in M. 
A. E = V-\Q) = {Cy^}. 

5. E itself is the largest invariant subset of E by the control law fl2Tl) . 

Thus, by LaSalle's invariance principle, the trajectories of the robots gov- 
erned by control law fl2T]) . starting from any initial configuration V{0) G Q'^ , 
will asymptotically converge to the set E, the critical points of 7i„, that is, 
the centroidal Voronoi configuration with respect to the density function as 
perceived by the sensors. □ 



17 



5.3. Effect of sensor range limits 

In reality, it is likely that the sensors will have limitations on their range. 
The sensors, in addition to having a monotonically decreasing effectiveness 
with Euclidean distance, might be totally insensitive to signals at distances 
larger than i?, the sensor range limit. It can also be thought of as follows: 
when the effectiveness falls below, say, a certain threshold value, for all prac- 
tical purposes, it can be assumed to be ineffective. 

In order to incorporate the sensor range limit, we need to modify the 
objective function (jS]). We do so by modifying [3 suitably. 

Proposition 1. Let (3{r) and (3{r) = c + P{r), where c is a real constant, be 
two sensor detection functions. The corresponding objective functions 7i(P) 
andTiiV), respectively, have the same critical points. 

Proof. The objective function with sensor detection function (3 is 
= E.e{i,2,...,7V}/y,0n(g)(l-/3(lb.-g||)W 

= EiG{l,2,...,iV} !v,<Pn{.q)il- |3{\\P^- q\\) - c)dQ 

= EiG{i,2,...,7V} 0n(g)(l - PiWPi - q\\))dQ - cEig{i,2,...,7V} /y, Mq)dQ 

= ^ - c ^2^e{l,2,...,N} Iv. Mq)dQ 

The second term not being a function of Pi, we have, 

opi dpi 

□ 

Consider the objective function with a saturation on [3. Let 

I otherwise 

The sensor detection function is shown in Figure [2] with dotted curve for 
R = 6. For r > 6, the sensor detection function remains fixed at /3(6). 
Consider the objective function 

n= Yl I <Pn{q){l-M\p^-q\\))dQ (24) 



18 



It is easy to show that the gradient of the objective function with the new 
update function /3 is given by, 

dijK) 

(P) = 2M(y^nB(p„i?))(C(y,n5(p,,i?)) - Pi) (25) 

where, the mass M and the centroid C are now computed within the region 
{Vi n B{pi,R)), that is, the region of Voronoi cell Vi, which is accessible to 
the i-th robot. The critical points are pi = C(y^f^B(:pi,R))- 

However, in reality, the detection function should become one beyond the 
range R (that is there is no reduction in uncertainty at points which are at 
a distance more than R from the sensor. To achieve this, let 

A. . \m + (1 - m)) = 1 - k{e-"'" - e-"«'), if r < 

f^{n = < . (26) 

I 1, otherwise 

= /3(r) + (l-/3(i?)) 

Figure [2] shows the sensor detection function jS in solid line for R = 6. After 
r = 6, the limit on sensor range, the sensor is ineffective indicated by the 
value 1. There will not be any reduction in uncertainty density in this region 
r > 6. 

Now by Proposition [H HiV), the objective function corresponding to the 
detection function (3, has the same critical points as those of the objective 
function 7i{V). 

The control law making robots move toward the new critical point is 

''^i = ~kprop{Pi — C'(y.nB(pi,R))) (27) 

Remark 4- The control law (I27|) is spatially distributed under the r-limited 
Delaunay graph Qld, for any robot configuration V. 



Theorem 4. The trajectories of the robots governed by the control law (21), 
starting from any initial condition V{0) G , will asymptotically converge 
to the critical points ofH. 

Proof. The proof is based on LaSalle's Theorem similar to those of The- 
orems [2] and [3] with V = —H{V). It can be shown that V is continuously 
different iable using Theorem 2.2 in [^. □ 
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r 



Figure 2: Illustration of /? and (3 in presence of the limit on sensor range. The dotted curve 
represent the sensor detection function /3 and solid curve is the actual sensor detection 
function (3{r) = f3{r) + (1 - (3{R)) 
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6. Illustrative examples 



In this section we show some simulation results to illustrate the CDS 



strategy in comparison with SDS [16|]. More detailed simulation results will 
be presented in a later section. 

Figure [3] (a) and (b) compares the trajectories of robots with SDS and 
CDS strategies. The trajectories with CDS are much smoother and shorter. 
The instances of search are indicated by 'o' along the trajectories. It can be 
seen that the search is performed at every discrete step in CDS, whereas the 
search is performed only after each optimal deployment SDS. Though there 
are 8 "deploy and search" steps in SDS, only 5 'o's are visible. In two of 
steps, multiple searches have been performed as the centroids in successive 
steps were closer than some tolerance limit dtoi = 0.3. Thus, there was no 
movement in corresponding deployment step. 

Figure [3](c) compares the history of uncertainty density of SDS and CDS, 
and it can be observed that the CDS reduces uncertainty relatively faster than 
SDS, in terms of number of time steps. Figured (d) shows the reduction in 
average uncertainty density with number of searches for SDS and CDS. It can 
be observed from this figure that SDS reduces the uncertainty in relatively 
fewer steps. This is apparent by very concept of optimal deployment in SDS. 
CDS takes about 4 searches to reduce uncertainty below 0.1, whereas SDS 
does this in only 3 searches. If SDS requires over 30 time steps to achieve 
this reduction, CDS needs 4 time steps. We can observe a tradeoff between 
the number of searches and and number of time steps required to accomplish 
in CDS and SDS. Once the uncertainty reduces to a large extent in initial 
search steps, by nature of the uncertainty density update rule amount 
of reduction in subsequent searches is less in both SDS and CDS. 

Figure m^a) illustrates 'robot 2' moving toward centroid corresponding to 
its Voronoi cell with SDS strategy. Robot's positions are marked with '+' 
while 'o' marks the centroids at successive time instances. Positions of robot 
in first two time steps are marked as 1 and 2, while those of centroids marked 
with 1' and 2'. It can be observed that the robot is tracking the centroid, 
which is changing as the Voronoi cell is changing. Deployment stops and 
search is performed when the robot is sufficiently close to the corresponding 
centroid. One of the search instances is also marked, where, after search, in 
order to track the next centroid, the robot takes an abrupt turn. This leads 
to a non-smooth trajectory. Figure H] (b) illustrates one of the robots moving 
toward centroid corresponding to its Voronoi cell with CDS strategy. It can 
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1 2 3 4 5 6 7 8 9 10 



(a) 




time steps 




I 2 3 4 5 6 7 8 9 10 



(b) 




number of searches 



(d) 



Figure 3: Trajectories of robots with A^=5 and without hmit on the sensor range for (a) 
SDS strategy and (b) CDS strategy. In both cases the points marked '+' indicate the 
starting locations of robots and 'o' indicate the end of deployment and points in space 
where search is being performed. In SDS, at a few places the search is performed more than 
once. This is indicated in trajectory of 'robot 1'. The reduction in average uncertainty 
density is shown in (c) against the number of time steps and (d) against the number of 
searches, for SDS and CDS. Even in (c) and (d), 'o' indicate the search instances. 
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(a) (b) 

Figure 4: Figure illustrating the process of a robot, following the respective centroid (a) 
with SDS and (b) with CDS. In both cases the robot location in each time step is shown 
by '+', and corresponding centroid locations are shown by 'o'. 
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be observed that the robot is tracking the centroid, which is changing as the 
Voronoi cell and the uncertainty density are changing. 



7. Implementation Issues 

Here we discuss some of the theoretical and implementation issues in- 
volved in combined deploy and search strategy. 

7.1. Spatial distributedness 

Here we discuss the implication of spatial distributedness of the proposed 
search strategies from a practical point of view. We have seen that both 
the search strategies are spatially distributed in the Delaunay graph. These 
results imply that all the robots need to do computations based on only 
local information, that is, by the knowledge about position of neighboring 
robots. Also, the robots should have access to the updated uncertainty map 
within their Voronoi cells. This can be achieved in several ways. One such 
way is that all the robots communicate with a central information provider. 
But it is not necessary to have this global information. The i-th robot can 
communicate with its Voronoi neighbors {{J^g{i)) and obtain the updated 
uncertainty information in a region Uxg(i)Vi. As the Voronoi partition {Vi} 
depends at least continuously on V, the robot configuration in an evolving 
Delaunay graph, the communication within the neighbors is sufficient for 
each robot to obtain the uncertainty within its new Voronoi cell. The issues 
related to communication of uncertainty information are not addressed in 
the paper except to assume that uncertainty information is available to the 
robots. It is also possible that the robots can estimate the uncertainty map 



as done in 29 



In practical conditions, the robots can communicate with other robots 
only when they are within the limits of the sensor range. The Delaunay 
graph does not allow sensor range limitations to be incorporated. We need 
to use r-Delaunay graph Qld to incorporate the sensor range limitations. 
The scenario changes with incorporation of sensor range limitations into the 
search strategies. The updating of uncertainty density will also be within 
the sensor range limits (in fact, it is within the region fl i?)). The 
centroid that is computed will also be within the new restricted area. For an 
optimal deployment problem, from the perspective of sensor coverage, it has 
been observed that the corresponding control law is still spatially distributed 
(in r-limited Delaunay graph) and globally asymptotically stable. 
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1.2. Synchronization 

Synchronization plays an important role in multi-agent systems. Here we 
discuss this issue for both SDS and CDS strategies. In the case of SDS, the- 
oretically all the robots reach the respective centroid at infinite time. But in 
a practical implementation, the robots are required to be sufficiently close, 
where the closeness is suitably defined, to the respective centroids before 
starting the search operation. It is possible that at any point in time, differ- 
ent robots are at different distances from the corresponding centroid. The 
robots need to come to a consensus as to when to end the deployment and 
perform search operation. We have implemented the strategy in a single 
centralized program using MATLAB. In a practical situation, synchroniza- 
tion can be attained by robots communicating a flag bit indicating if a robot 
has reached its centroid or not. When all the robots have reached the re- 
spective centroid within a tolerance distance, the search can be performed. 
We also assume that sensing and communication are instantaneous. In our 
simulation experiments we assume such a communication exists. Since the 
objective of this work is to evaluate the effectiveness of the search strate- 
gies, we make assumptions that simplify implementation without affecting 
the search effectiveness. 

CDS operates in a synchronous manner by design. If all the robots start 
at the same instant of time and have synchronized clocks, the search task is 
performed by every robot after the same interval of time. Given an accurate 
global clock, synchronization is not a major issue in case of CDS. 

Further in j^, authors provide an asynchronous implementation for cov- 
erage control which can be suitably modified for CDS to operate asyn- 
chronously. 

8. Results and Discussion 

In this section we present simulation experiment results to compare the 
performance of SDS and CDS strategies with greedy search and random search 
tailored to suit the problem setting addressed in this work. Random and 
greedy are generic strategies which can be tailored to suit any problem setting. 
We restrict the robots to move at a constant speed and also discretize the 
heading direction of robots at 1 degree resolution. Below, we discuss simple 
strategies used for the purpose of comparing the performance. 

Typically search problems do not consider dynamics of search robots as 
the focus is more on the effectiveness of search, that is, being able to identify 
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region of high uncertainty and distribute search effort to reduce uncertainty. 
Moreover, it is usually assumed that the search region is large compared to 
the physical size of the robot or the area needed for the robot to maneu- 
ver. We assume a first order dynamics for the robots for the purpose of 
simulations. 

Greedy search 

We discuss two simple greedy strategies. In the CDS, the z-th robot moves 
toward the centroid of Vif\B{pi, R)) with as density. The Voronoi partition 
takes care of the coordination among the neighboring robots. In greedy 
search we simply let the i-th robot move toward corresponding centroid of 
B{pi,R). Thus there is no cooperation between robots. 

According to the update rule (jl]), only the robot which is most effective 
will perform search task at any given point g G Q. This leads to the idea of 
each robot performing search task within its own Voronoi cell. We call such 
a greedy strategy as Voronoi greedy search (VGS) strategy. In this strategy, 
only the control law is greedy whereas the search is performed in a cooperative 
manner. In case of cooperative search, each robot discards information about 
the area which is better accessible to its neighboring robots. 

In a true greedy search strategy we expect no cooperation among robots 
at any point in time. A true greedy strategy (TGS) can be achieved if each 
robot performs search task independently at every point within B{pi,R), 
leading to duplication of the search task. The update law takes the form, 

0n+l(g)=0n(g) n (^(\\P^-^\\) (28) 

{i\p,.eB{q,R)} 

It should be noted that the idea of collecting information by all sensors in 
contrast to using the information from the most effective sensor is problem 
dependent. If the sensor is a camera, then many cameras taking picture of 
the same area may not add to the information. 

Random search 

Random search (RS) is probably the simplest search strategy. Here we 
assume the robots move with a constant speed and the direction of the robots 
is generated randomly. The uncertainty density updating can be given either 
by (jlj) or by fl28|) . We have used the update law given by (jlj) in this case. 

In TGS, VGS, and RS strategies, the search task is performed in every 
time step as in CDS. 
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Simulation results for performance comparison 

The comparison is based on the number of time steps required to reduce 
the average uncertainty below a specified value. 

The parameters which were varied during the simulations are the number 
of robots (A^), the sensor range limit (R), and the speed of robots (U). It 
was desired that the average uncertainty density should be reduced to a value 
below 0.002. 

Table I compares the performance of the search strategies discussed in this 
paper in terms of the number of searches performed before the termination 
condition is reached. The first column gives the parameters A^, R, and 100 xt/ 
as N.R.IOOU. Thus, 20.4.10 means N = 20, R = A, and U = 0.1. From 
Table I, it can be observed that in most cases, the combined deploy and 
search strategy performs better followed by true greedy strategy, Voronoi 
greedy strategy, and random search in order of degrading performance as 
indicated by the number of time steps. In terms of the number of searches 
being performed, sequential deploy and search strategy performs better than 
combined deploy and search strategy. 

The numbers in Table I indicate that with an increase in N and R the 
performance of all the strategies improves. Figure [6](a) illustrates this. In 
this figure, the points have been interpolated using a shape preserving inter- 
polation scheme. 

Performance of CDS strategy deteriorates at higher speeds and lower limit 
on sensor range as illustrated in Figure O^b) for combined deploy and search. 
The robots do not move if the centroid of respective Vi fl B{pi,R) is closer 
than U/2, the tolerence set in the program. With large U and small R, quite 
often this condition restricts the motion of robots. The condition adversely 
affects the performance of CDS as the region Vi fl B{pi, R) is always smaller 
than the region B{pi,R), which is used by greedy strategies for computing 
the centroid. That is, in case of CDS, it is more likely that the centroids 
are closer than f//2 to robots. In case of greedy strategies, the robots need 
to move toward the centroids of corresponding B{pi,R). This region being 
relatively larger, it is less likely that the robots will be closer to centroids. It 
has been observed during simulations, particularly with CDS, that many of 
the robots do not move during the entire search operation. When number of 
robots is lower, difference in areas of above two regions is more likely to be 
higher. 

Figures [7] (a) and (b) show the time history of the average uncertainty 
density distribution for combined deploy and search, true greedy, Voronoi 
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Table 1: Comparison of performance of various multi-robot search strategies (CDS: Com- 
bined deploy and search; SDS: Simultaneous deploy and search; VGS: Voronoi greedy 
search; TGS: True greedy search; RS: Random search. The first column gives the param- 
eter of simulation experiments as (N).(R).(100U). The numbers indicate the number of 
search tasks performed before reaching the termination condition. The maximum number 
of steps was restricted to 2000. The figures within parentheses for the case sequential 
deploy and search are the actual times required for achieving the task. 
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Figure 6: (a) Number of time steps vs. sensor range R for combined deploy and 
search with a speed of 0.25 for different number of robots (b) Number of time steps 
versus robots' speed U for combined deploy and search for 20 robots with sensor 
range R = 3 (c) Number of time steps vs. robots' speed U for random search for 
20 robots with sensor range R = 4. 



greedy, and random search strategies for two different sets of parameters, one 
witfi N = 5, R = 2 units, and U = 0.5 units and anotlier witli N = 20, i? = 4 
units, and U = 0.5 units. Both the figures illustrate that the combined deploy 
and search takes minimum time steps while true greedy, Voronoi greedy, and 
random search strategies follow in order of degrading performance in terms 
of number of time steps. It should be noted though all these strategies 
perform search task in every time step, CDS performs better because of 
better coverage of the search space due to cooperation among the robots 
through the Voronoi partition. The CDS reduces uncertainty, in most cases, 
more than the other strategies (TGS, VGS, and RS) for same number of 
searches. 

Figures M (a) and (b) show the trajectories of robots for the above sets 
of parameters. Figures iQlfTTl show the robot trajectories with Voronoi greedy, 
true greedy, and random search strategies, respectively, for the same sets of 
parameters. 

With the greedy strategies, when the parameters are such that the con- 
vergence is slower, it is more likely that robots start moving together in 
clusters. This happens due to lack of cooperation between robots and each 
of them moving toward points with higher uncertainty density. Once this 
happens, the robots move together, and in case of Voronoi greedy search 
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Figure 7: Average uncertainty density distribution with R — 2, U ~ 0.5 and (a) with 5 
robots and (b) with 20 robots. 

where only one of the robot is active at a given point, the basic purpose 
of deploying multiple robots for search is defeated, leading to a much slower 
convergence. The strategy is equivalent to a single robot searching the space. 
With true greedy search strategy, though all the robots perform search within 
their sensor range, only in a very few cases, the performance is comparable 
to combined deploy and search. Cooperation among robots in combined de- 
ploy and search and sequential deploy and search strategies ensure a better 
coverage of the space and sharing of search load among robots. 

As expected, random search does not perform well when the speed is 
less. But as the robot speed is increased, the random search performs better. 
Unlike in other strategies, in case of the random search strategy, the perfor- 
mance always increases with speed as illustrated in Figure [n](c). We compute 
the average length of the trajectory of a robot by multiplying the time, which 
is equal to the number of steps with the speed. Figure [6](c) also shows the 
average trajectory length of robots indicating that after a speed of 1 unit, 
average trajectory length is almost a constant value. It is interesting to note 
that the random search strategy performs better than all other strategies at 
high robot speeds. This is due to better coverage of the search space by 
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Figure 8: Trajectories of robots with combined deploy and search strategy for U — 0.5 
and (a) 5 robots with sensor range R = 2, (b) 20 robots with R = 4. '+', indicate initial 
positions of robots. 

robots. Motion of robots are restricted by the specified control law in case 
of other strategies, which does not happen in case of random search. 

In the case of sequential deploy and search strategy, as the robots ap- 
proach the optimal deployment, we allow partial stepping to enable robots 
to move to as close as possible the respective centroids. Figure fT2] shows robot 
trajectories for the sequential deploy and search strategy. With a small sen- 
sor range, the time required for completing search is more than that for the 
rest. But as the sensor range is increased, there is a continuous improvement 
in the performance leading to the fastest convergence amongst the compared 
strategies. Note that in the case of the rest of the strategies, the time taken 
is the same as the number of search steps. The performance of sequential 
deploy and search strategy seems to monotonically increase with A^, R and 
U. 

In our formulation of the objective function, we do not attach any cost 
to the search operation. This is the basic motivation for strategies such 
as combined deploy and search. In such a situation we observe that the 
combined deploy and search is best suited for reducing uncertainty. But in a 



31 




(a) (b) 

Figure 10: Trajectories of robots with true greedy search strategy for (a) N = 5, R 
and U — 0.5 (b)iV = 20, R = A and U = 0.5. '+', indicate initial positions of robots. 
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(a) (b) 

Figure 11: Trajectories of robots with random search strategy for (a) iV = 5, i? = 2 and 
U = 0.5 (b)A^ = 20, R — A and U — 0.5. The initial positions of robots are as in Figure 
[8l The number of points in each trajectory is 814. 
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(a) (b) 

Figure 12: Trajectories of robots with sequential deploy and search strategy for (a) TV = 5, 
i? = 2 and J7 = 0.5 (b)A^ = 20, i? = 4 and U = 0.5. '+', indicate initial positions of 
robots and 'o' indicate end of deployment and points along the trajectories where search 
is performed. 
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practical situation, some cost may be associated with each search operation, 
in which case sequential deploy and search strategy is more efficient. True 
greedy is effective in certain parameter ranges and when in a situation where 
the duplication of information by more than one sensor is beneficial. Voronoi 
greedy is always below both combined deploy and search and true greedy in 
performance in terms of total time. 

It can be noted that, though the Voronoi partition based strategies pro- 
posed in this paper are computationally complex, their performance is bet- 
ter than the simple strategies such as true greedy and random search. The 
Voronoi based strategies result in collision free trajectories in a natural way, 
whereas non- Voronoi based search strategies need additional coUision avoid- 
ance schemes and require additional computation. 

9. Conclusion and future work 

The problem of multi-robot search in an unknown environment with a 
known uncertainty probability distribution function is addressed in this pa- 
per. A multi-robot search strategy namely combined deploy and search (CDS) 
was presented as a modification of sequential deploy and search (SDS) strat- 
egy proposed in our previous work. We have shown that the centroidal 
Voronoi configuration with respect to the density as perceived by the sensors 
are the instantaneous critical points of the objective function maximizing 
the single step search effectiveness, if the robots are located at corresponding 
centroids. It was shown, that the CDS strategy can reduce the average uncer- 
tainty to any arbitrarily small value in finite time. The optimal deployment 
strategy has been analyzed in presence of some constraints on robot speed 
and limit on sensor range for convergence of the robot trajectories with the 
corresponding control laws responsible for the motion of robots. The pro- 
posed CDS strategy was explained with help of a illustrative example, in 
comparison with SDS strategy. 

Based on a set of simulation experiments, the performance of SDS and 
CDS strategies have been compared with standard strategies such as greedy 
and random search with a constant speed constraint on robots and sensor 
range limit. The parameters A^, the number of robots, R, the sensor range 
limit and U, the speed of robots were varied to compare the performance 
of different strategies. The simulation results indicated that the combined 
deploy and search strategy is best suited when search task does not involve 
any cost. The true greedy search strategy performed well in a few parameter 
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ranges and is acceptable only in a situation where gathering of information 

by more than one sensor in an area makes sense. When a cost is associated 
with the search task, sequential deploy and search strategy was found to be 
more suitable. 

Though the Voronoi partition based strategies such as SDS and CDS 
are computationally complex, their performance is better than the simple 
non- Voronoi based strategies such as true greedy and random search. The 
Voronoi based strategies result in collision free trajectories in a natural way, 
whereas non- Voronoi based search strategies need additional collision avoid- 
ance schemes and require additional computation. 
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Theorem A.l: The gradient of the multi-center objective function 
with respect to Pi is given by 

dHr, f . d 



dpi 



[ <p{q)^{l-f3{r,))dQ (29) 

JVi ^Pi 



where ri = \\ pi — q\\. 
Proof. Rewrite ([5]) as 

71= ^ (30) 

ie{i,2,...,Af} 

where W = Jy^{f{r))^{q)dQ. Now, 

E ^ (31) 

where, /(■) = 1 - /3(-). 

Applying the general form of the Leibniz theorem ([2 

+ EjGiv, L,, ni,(g).Ui,(g)0(g)/(||g -pilD^g (32) 
+ Eie7v, L,, %i(9)-Uii(g)0(g)/(||g -PjIDcig 

where, 

1. A^j is the set of indices of agents which are neighbors of the i-th agent 
in the Delaunay graph Qld- 

2. Aij is the part of the bounding surface (line segment in two dimensional 
case) common to Vi and Vj. 

3. njj(g) is the unit outward normal to Aij at g G Aij. Note that iiij{q) = 
-nji{q), Vg G Aij. 

4. Uij{q) = -^^(g), the rate of movement of the boundary at g G Aij with 
respect to pi. Note that Ujj(g) = Ujj(g). 

5. Note also that f{\\q-Pi\\) = /(||g-Pj||), Vg G Aij, as ||g-Pi|| = \\q-Pj\\, 
by definition of the Voronoi partition. 

By (3)-(5) above, it is clear that for each j G Aj, 



nij{q).u^j{q)(t){q)f{\\q - pi\\)dQ = - nji{q).Uji{q)(f){q)f{\\q - pj\\)dQ 
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Hence, 

/ 4>{q)^{h-Pi\\)dQ 

OPi Jvi OPi 

□ 

It can be observed that necessary smoothness conditions are vahd as 

(i) 0, / G 

(ii) At the boundaries of the Voronoi cells, the objective function does not 
have any jumps as /(rj) = /, at the boundary, by definition of the 
generalized Voronoi partitions. 

(iii) The Voronoi partition itself varies smoothly with V. 
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